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Claim Amendments 

Claim 1. (currently amended) A computerized storage and retrieval system of biological information 
comprising: 

a means for sntsriflg data [entey]; 

a means for displaying the data; 

a programmable central processing unit for performing automated analysis using semi-linear pathway 
representation (SLIPR) and methods to compare and predict pathways and their components and to establish 
a pathways database; [and] 

a [data storage ] means for storing data: and 

a relational database containing annotated and curated [ protein ] pathways and [ annotated] information on 
the pathways and their components [ otored in a relational databas e , wherein the pathways are annotated and 
organized in a ouratod clust e ring arrangement and tho annotated information is aoo e ss e d through th e 
relational database ]. 

Claim 2. (currently amended) The system of claim 1„ wherein the information pertaining to the pathways 
and their components and [«]stored in a plurality of tables further comprises [comprising] proteins, their 
sequences and attributes; protein interactions; protein-protein associations; protein pathways; mRNA, 
microarray, and protein expression data; genes, their sequences and attributes; [aod] descriptions of cells, 
tissues, and organs [-]; pathology reports ftl: and patient histories[ 7 ]and treatments. 

Claim 3. (currently amended) The system of claim 1, wherein the central processing unit is programmed to 
run an algorithm or a method for constrained clustering or an optimization algorithm for dynamic 
programming and one or more functions or standard methods selected from retrieve, input, edit, annotate, 
search, calculate similarities, align, compare, optimize using linear next-neighbor criteria or global 
minimization criteria, and predict homologous or orthologous [protein] pathways and their components and 
wherein the method for constrained clustering is further selected from average linkage, single linkage, 
complete linkage. K-means. or self-organizing maps . 

Claim 4. (currently amended) The system of claim 1, wherein the central processing unit is further 
programmed to perform one or more analyses selected from protein sequence analysis, protein interactions 
analysis, protein-protein association analysis, protein pathway analysis, gene expression analysis, pathway 
annotation analysis, pathway edit analysis, pathway expression analysis, tissue expression analysis, 
subtract ive hybridization analysis, electronic northern analysis, and [er] commonality analysis. 

Claim 5. (currently amended) The system of claim 1 , wherein each [th e data is e nt e r e d using the stnndard- 
fe*] pathway in the database further comprises at least two nodes connected by a mode [r e pr e s e ntation] . 
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Claim 6. (currently amended) The system of claim 1 [4-], wherein [o moons for displaying tho data io usod to 
show two related pathways as a diagram containing nodes vvhioh roprosont proteins or non protoin molecules; 
modes that represent prot e in interactions or protein protein associations;] at least two pathways are compared 
using scores calculated from sequence identity, motif or structural homologies that interrelate nodes f>] and 
employing coefficients of similarity calculated or selected from gene expression including clone counting of 
cDNAs. ESTs. promoter regions, or regulatory sequences; electronic northern analysis, immunoprecipitatton^ 
microarrav data, quantitative real-time polymerase chain reaction, transcript images; and yeast two hybrid 
data; a promoter similarity matrix, a protein-protein association matrix and a similarities among protein 
interactions matrix (SCIM matrix) that interrelate modes of the pathways. 

Claim 7. (currently amended) The system of claim 1 , wherein the central processing unit is programmed to 
[oomporo] perform pairwise comparison of at least two [protoin] pathways or their components selected from 
[hy-ft] node-only, [a] mode-only, and [er-a] node-and-mode comparison and wherein the node-only 
comparison is further selected from protein only, non-protein only, and protein and non-protein nodes. 

Claim 8. (currently amended) The system of claim 3 [4-], wherein the central processing unit [is programmed 
to] run s the [an] optimization algorithm for dynamic programming further comprising: 

a) initializing an array, in which a two dimensional array M-Mij with J rows of nodes and variant 
length for each row, the length for i-th row is nj is set up and Mjj=0, where K=i<=nj, 

b) backfilling the array via backward recursion with the formula 

M lk -n^|w(a tt ,a^)H-Af y/ ^(w(%,^ / ))| forl<fc<w,, l<i<J 

where e(.) is the step function defined as e (v)={0, if v<=0; 1, if v>0} and w(.,.) is the scoring function 
between the two nodes, defined as 



w(j2tit 9 aji) = < 



0, if i = j, aik = a» , aiic = -D, or an = — D 

where O > 0, or 

#[Cik,jL-tc]« |Sik— Sjl|)+ (1- OC) Cik,jl} otherwise " " 



c) [using trooobaok to identify] finding the best alignment to predict nodes of putative pathways PPWj, 
l<=j<=max nj with the top n best scores. 

Claim 23. (currently amended) The system of claim 3 [+], wherein the central processing unit fts- 
progrommod to run a ] runs the constrained clustering algorithm further comprising: 

a) assigning a distance between a protein in a query pathway and each protein [every pair of proteins] in 
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the database; 

b) merging at each round the [two] protein in the query pathway and the protein in the database with the 
closest distance [pairs] into a cluster until user-set threshold is met; and 

c) computing mean linkage between two clusters using 



wherepn] n, is the size of c t and cl, is the size of Cj and the distance between the two clusters is a weighted 



Claim 24. (canceled) 

Claim 25. (currently amended) The system of claim 7, wherein [pathway analysis us e s a ] node-and-mode 
comparison predicts new homologous or orthologous pathways by [comprising: 
a) — s ubmitting a qu e ry pathway and prot e in s e qu e nc es ; 

b)] comparing nodes using an optimization algorithm for dynamic programming or a constrained 

clustering algorithm or standard methods and [ wh e r e in a s e qu e nce identity sooro or p value summarizes 
similarity and a w e ighting - factor between 0 and 1 is as s ign e d - to corresponding nodos; 

e) — comparing ] modes by generating a SCIM matrix [ wherein a ] containing coefficients of similarity is- 

as s ign e d to corr e sponding mod e s; 

d) aligning pathways globally or locally, wher e in ins e rtion or d e l e tion of nod e s or modes incurs a 

p e nalty; 

o) summing all similarity scores; and • 

f) — displaying at l e ast on e high s coring se gm e nt of the aligned pathways derived from node and mode- 
comparison, th e r e by p e rforming pathway ana l ysis ! . 

Claim 26. (currently amended) A process for [Th e syst e m of olaim 1 , wherein the central processing unit is 
programmed to perform ] analyzing or predicting novel homologous or orthologous pathways [ analysi s using 
a submitted query pathway and prot e in - s equences ] comprising: 

a) submitting a query pathway and protein sequences: 

b) organizing the pathway and sequences using SLIPR [ the standard for pathway representation ]; 

c) [b)J comparing each [protein ] sequence [s] of the query pathway with each [aH protein] sequence[s] 
in the pathways database using an algorithm for constrained clustering or an optimization algorithm for 




average of the distance between two proteins i and j and a function of similarity to query protein (si=S jq( j) , 
sj= Sjq(j)) where q(i) is the query protein homologous to database protein i and protein-protein association 
(ajj)and , .1 c 
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dynamic programming and standard methods of protein comparison to calculate sequence identity scores: 

e)] d) using a SCIM matrix derived from protein interactions or protein-protein association analysts to 

compare coefficients of similarity for each interaction of the query pathway and each [aU] interaction [s] for 
[proteins in] a pathway in the pathways database; 

e) [d>] calculating an overall-similarity score f OS-score} based on sequence identity scores and 
coefficients of similarity for each pathway ; and 

f) [e ) r e moving all] pathways not mooting a user specified threshold for OS score; and] retrieving 
[aligned] pathways meeting a user-specified [the] threshold for OS-score, thereby [ performing ] analyzing or 
predicting novel homologous or orthologous pathways [anolysia] . 

Claim 27. (currently amended) The system of claim 4 (3J, wherein [ pathways database is soarohod for 
protoin interactions comprising: 
a) — submitting a query pathway; 

b) performing] protein interactions analysis between a [the] query pathway and each [eU] pathway[s] in 

the pathways database produces a [wherein] coefficient of similarity [is produc e d ] to interrelate each mode of 
the query pathway and a mode of the most closely related protein pathway [*- 

e) — retrieving at least on e pathway alignment; ■ and - 

d) showing the alignm e nt ba s ed on the soaroh for protoin i nteractions ] . 

Claim 28. (canceled) 

Claim 29. (currently amended) The process of claim 26, wherein [The syst e m of claim 3, wher e in a qu e ry 

pathway to search Q pathways database to predict] the predicted orthologous pathway [s comprising: 

a) — submitting a query- pathway and known protoin sequenc e s; 

e) — comparing known soquonoo s to all protoin sequenc e s stored in the database; 

o)] retrieving further comprises orthologous proteins with the highest sequence identity to the [known] 

proteins in the query pathway fej-and 

[d)] inheriting protein interactions inherited from the query pathway [; and - 

©) — aligning the query pathway and th e orthologous prot e ins, ther e by predicting orthologous pathways ]. 

Claims 30-36 (canceled) 
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